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Parametric audio coding 



The iiwention relates to parametiic audio coding. 

Heiko Pumhagen, "Advances in parametric audio coding', Proc, 1999 IEEE 
Worlzshop on Applications of Signal Processing to Audio and Acoustics, New Paitz, New 
York, Oct. 17-20, 1999 discloses that paiametric modeling provides an efficient 
representation of general audio signals and is utilized in very low bit rate audio coding. It is 
based on the decomposition of an audio signal into components which are described bj^ 
appropriate source models and represented by model parameters (like the frequency and 
amplitude of a pure tone). Perception models are utilized in signal decomposition and model 
parameter coding. 

An object of the invention is to provide an advantageous parameterization of a 
multi-channel (e.g. stereo) audio signal. To this end, the invention provides a metliod of 
encoding, an encoder, an apparatus, an encoded audio signal, a storage medium, a method of 
decoding, a decoder and a receiver or reproduction apparatus as defined in the mdependent 
claims. Advantageous embodiments are defined in the dependent claims. 

It is noted that stereo audio coding as such is known in the prior art. For 
example, the two chamiels left (L) and right (R) may be coded independently. This may be 
done by two independent encoders arranged in parallel or by time multiplexing in one 
encoder. Usually, one can code the two channels more efficiently by using cross-channel 
correlation (and irrelevancies) in the signal. Reference is made to the MPEG-2 audio standard 
(ISO/IEC 13818-3, pages 5,6) which discloses joint stereo coding. Joint steieo coding 
exploits the redundancy between left and right chamiels in order to reduce the audio bit-rate. 
Two fomis of joint stereo coding are possible: MS stereo and intensity stereo. MS stereo is 
based on coding the sum (L+R) and the difference (L-R) signal instead of the left (L) and 
right (R) channels. Intensity coding is based on retaining at high firequencies only the energy 
envelope of the right (R) and left (L) channels. Direct application of the MS stereo coding 
principle in parametric coding instead of in subband coding would result in a parameterized 
sum signal and a parameterized difference signal. The forming of the simi signal and the 
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difference signal before encoding might give rise to the generation of additional frequency 
components in the audio signal to be encoded which reduces the efficiency of the parametric 
coding. Direct application of the uitensity stereo codiag principle on a parametric coding 
scheme would result in a low fiequency part with independently encoded channels and a high 
frequency part that includes only tlie energy envelope of the right and left chanutiels. 

According to a first aspect of the invention, common frequencies are 
determined in the at least two chamiels of the audio signal, which common frequencies occur 
in at least two of the at least two chamiels, and respective sinusoidal components in 
respective chamiels at a given coimiioii frequency are represented by a representation of the 
given connnon frequency, and a representation of respective amplitudes of the respective 
sinusoidal components at tlie given common fr equency. This aspect is based on the insight 
that a given frequency generated by a given source has a high probability to have a 
component ui each of the channels. These signal components will have their frequency in 
common. Tliis is true because signal transfomiations that may occur in the transmission from 
somid source via recording equipment to tlie listener will usually not affect frequency 
components differentially in tlie various or all channels. Thus, common components in the 
various signal channels can be represented by a single, common frequency. The respective 
amplitudes (and phases) of the respective components in the respective channels may differ. 
Thus, by coding the sinusoids with a common frequency and a representation of the 
respective amplitudes, an efficient compressive coding of the audio signal is achieved; only 
one parameter is needed to encode a given common frequency (which occurs in various 
chamiels). Further, such a parameterization is advantageously applied with a suitable psycho- 
acoustic model. 

Once a common frequency has been found, the other parameters describmg 
the components in each respective channel can be represented. For example, for a stereo 
signal tliat is represented with sinusoidal components, the mean and the difference of the 
amphtudes (and optionally the respective phases) can be coded. In a fiarther embodiment, the 
largest amphtude is encoded in the coded audio stream together with a difference amplitude, 
wherein the sign of the difference amphtude may detemdne the dominant channel for this 
frequency. 

Since there is likely to be some degree of correlation between the left and the 
right channels, entiopy coding of the sinusoidal parameters can be used which will result in 
more efficient encoding of the stereo signal. In addition, irrelevant mformation within the 
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common component representation can be removed, e.g. interaural phase differences at high 
frequencies are inaudible and can be set to zero. 

It is possible to encode any frequency occurring in tlie channels as a common 
frequency. If a frequency occuning in one channel does not occur in another channel, the 
amplitude representation should flien be encoded such as to result in a zero amplitude for the 
channel in which the frequency does not occm-. For example if in a multi-channel application 
a frequency occurs in 3 of the 4 channels, then the frequency can be encoded as a common 
frequency while making the amplitude zero in the channel m which the frequency does not 
occur. 

Non-common fr equencies may also be represented as independent suiusoids in 
the respective channels. Non-common frequencies can be encoded in a separate parameter 
block. It is ftirther possible to produce a first parameter block mcluding common frequencies 
wliich common fi equencies ai e common to all channels, a second parameter block wliich 
mcludes fiequencies which are common to a (predetermined) subset of all channels, a third 
parameter block wliich includes fiequencies which are corranon to a fiirther (predetermined) 
subset of all channels, and so on until a last parameter block which includes the frequencies 
wliich occur in only one channel and which are independently coded. 

A common frequency may be represented as an absolute frequency value but 
also as a frequency changing over time, e.g. a first derivative df/dt. Further, the common 
frequencies may be differentially encoded relative to other common frequencies. 

Common fiequencies can be found by estimating frequencies by considering 
two or more chamiels at the same time. 

In a first embodknent, fiequencies are separately determined for the respective 
channels followed by a comparison step to determme the common frequencies. The 
determination of the fi equencies occiuring in the respective channels may be performed by a 
conventional matcliing pursuit (see e.g. S.G. Mallat and Z. Zhang, "Matching pursuits with 
time-frequency dictionaries," IEEE ti'ans, on Signal Processing, vol. 41, no. 12, pp. 3397- 
3415) or peak picking (see e.g. 'R. McAulay and T. Quatieri, "Speech Analysis/Synthesis 
Based on a Sinusoidal Representation," IEEE Trans, ASSP, Vol. 34, No. 4, pp. 744-754, 
Aug. 1986). 

In a second embodiment for deteraiining the common frequencies, a combined 
matcliing pursuit algorithm is employed. For example, respective power or energy 
representations of the at least two channels are combined to obtain a common representation. 
The common frequencies are then deteraiined based on the common representation. 
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Preferably, the power spectra of the at least two channels are added to obtain a common 
power spectrum. A conventional matching pm-suit is used to determine the frequencies in this 
added spectrum. The frequencies foiind m this added power spectrum are determined to be 
common frequencies. 

5 hi a third embodmient for deterrniiiing tlie common frequencies, peak picking 

in added power specti a is used. The fr equencies of tlie maxima that are found in tliis common 
power spectrum can be used as the conmion frequencies. One could also add log-power 
spectra instead of linear power spectra. 

Preferabl)^, the phase of tlie respective components of the common frequency 

10 is also encoded. A coiiunon phase, which may be the average phase of the phases m the 

chamiels or the phase of the chaimel with the largest ampUtude, and a difference phase (inter- 
chamiel) may be included m tlie coded audio signal. Advantageously, the difference phase is 
only encoded up to a given tlireshold frequency (e.g. L5 kHz or 2 kHz). For frequencies 
higher than this tliieshold, no difference phase is encoded. This is possible without reducing 

15 the quaUty significantly, because human sensitivity to mteraural phase differences is low for 
frequencies above tliis tlireshold. Therefore, a difference phase parameter is not necessary for 
frequencies above the given tlueshold. Upon decodmg, the delta phase parameter can be * 
assumed to be zero for frequencies above the tlueshold. The decoder is ananged to receive 
such signals. Above the threshold frequency the decoder does not expect any codes for 

20 difference phases. Because the difference phases are in practical embodiment not provided 
with an identifier, it is important for the decoder to know when to expect difference phases 
and when not. Further, because the hmiian ear is less sensitive to large iateraural iutensity 
differences, delta amphtudes wliicli are larger than a certain threshold, e.g. 10 dB, can be 
assumed infinite. Consequently, also in this case no interaural phase differences need to be 

25 encoded. 

Frequencies in different channels differing less than a given threshold may be 
represented by a common frequency. In this case it is assumed that the differing frequencies 
originate from the same source fi equency. In practical embodiments, the threshold is related 
to the accuracy of the matcliing pmsuit or peak-picking algorithm. 
30 In practical embodiments, the parameterization according to the invention is 

employed on frame-basis. 

The invention is appUcable to any audio signal, including speech signals. 
These and otlier aspects of the invention wiU be apparent from the elucidated 
with reference to the accompanying drawings. 
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In the drawings: 

Fig. 1 shows an encoder accoiding to an embodiment of the invention; 
Fig. 2 shows a possible implementation of the encoder of Fig. 1; 
Fig. 3 shows an altemative implementation of the encoder of Fig. 1, and 
Fig. 4 shows a system according to an embodiment of the invention. 
The drawings only show those elements that are necessary to miderstand the 
embodiments of the invention. 

Fig. 1 shows an encoder 1 1 according to an embodiment of the invention. A 
multi-channel audio signal is input to the encoder, hi this embodiment the multi-chamiel 
audio signal is a stereo audio signal having a left channel L and a rigjit channel R. The 
encoder 1 1 has two inputs: one mput for tlie left channel signal L and another input for the 
right channel signal R. Alternatively, the encoder has one input for both channels L and R 
which are in that case fiimished in a multiplexed form to the encoder 1 1 . The encoder 1 1 
extracts sinusoids from both channels and determines common frequencies fcom- The result of 
the encoding process performed in the encoder 1 1 is an encoded audio signal. The encoded 
audio signal includes the conmion frequencies fcom and per common frequency fcom a 
representation of the respective amplitudes in the respective chaimels, e.g. in the fomi of a 
maximum or average amplitude A and a difference (delta) amplitude AA. 

Li the following, it is described how the common frequencies may be 
determined, a first embodiment employing a matchmg pursuit and second embodunent 
employing peak-picking. 

An embodiment employing 'matching pursuit' 

This method is an extension of existing matching pursuit algoritlmis. Matching 
pursuits are well-known in the art. A matching pursuit is an iterative algorithm. It projects the 
signal onto a matching dictionary element chosen from a redundant dictionary of time- 
frequency waveforms. The projection is subtracted from the signal to be approximated in the 
next iteration. Thus in existing matching pursuit algorithms, the parameterization is 
performed by iteratively determining a peak of the 'projected' power spectrum of a frame of 
the audio signal, deriving the optimal ampUtude and phase corresponding to the pealc 
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frequency, and extiacting the corresponding sinusoid from the frame under analysis. This 
process is iteratively repeated until a satisfactory parameterization of the audio signal is 
obtained. To derive conunon frequencies in a multi-channel audio signal, the power spectra 
of the left and right channels ai e added and the peaks of this sum power spectrum are 
5 detennined. These peak frequencies are used to detemiine the optimal amplitudes and 
optionally the phases of the left and tlie right (or more) channels. 

The multi-chaimel matching pursuit algorithm according to a practical 
embodiment of tlie invention compi ises the step of splitting the multi-channel signal into 
short-duration (e.g. 10 ms) oveiiappmg frames, and applying iteratively the followhig steps 
10 on each of the frames until a stop criterion has been met: 

1 . The power spectra of each of the charmels of the multi-channel frame are calculated 

2. The power spectra are added to obtain a common power spectrum 

3. The frequency at wliich the common 'projected' power spectrum is maximum is 
detemiined 

15 4. For the frequency determined in step 3, for each channel, the amplitude and phase of the 
best matching sinusoid are determmed and all these parameters are stored. These parameters 
ar e encoded using the common frequencies in combination with a representation of the 
respective amplitudes thereby exploiting cross-channel correlations and irrelevancies. 
5 . The smusoids are subtracted from tlie corresponding current multi-channel fr ames to 

20 obtain an updated residual signal which serves as the next multi-channel frame in step 1 . 

Embodiment using 'peak p ickinp;' 

Alternatively, peak picking maybe used, e.g. including tlie following steps: 
25 1 . The power spectra of each of the chaimels of the multi-channel frame are calculated 

2. The power spectra are added to obtain a common power spectrum 

3. The frequencies corresponding to all peaks within the power spectrum are determined 

4. For these detennined frequencies, the best amplitudes and best phases are obtained 

30 Fig. 2 shows a possible implementation of the encoder of Fig. 1, which makes 

use of a common (added) power spectrum of the channels to determine the corrunon 
frequencies. In calculation unit 110 a matching pursuit process or a peak picking process is 
performed as described above by using a common power spectrum obtained from the L and 
R charmels. The determined common frequencies ^om are furnished to coding unit 111. This 
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coding unit determines the respective amplitudes of the sinusoids (and preferably the phases) 
in the various channels at a given common frequency. 

Alternatively, the respective channels are independently encoded to obtaia a 
set of parameterized sinusoids for each chamiel. These parameters are thereafter checked for 
5 common frequencies. Such an embodiment is shown in Fig. 3. Fig. 3 shows an alternative 
implementation of the encoder 11 of Fig. 1 . hi tliis implementation, the encoder 1 1 comprises 
two independent parametric encoders 11 2 and 113. The parameters fu Al and fa, Ar obtained 
in these independent coders are frunished to a fiirther codhig unit 114 which determines the 
common frequencies fcom in these two parameterized signals. 

10 

Example of coding a stereo audio signal 



Assume that a stereo audio signal is given with the following characteristics: 



channel 


f (Hz) 


A(dB) 


f(Hz) 


A(dB) 


f(Hz) 


A(dB) 


f (Hz) 


A(dB) 


f (Hz) 


A(dB) 


L 


50 


30 


100 


50 


250 


40 






500 


40 


R 


50 


20 


100 


60 






200 


30 


500 


35 



15 hi practice, in the case the amplitude difference between the chamiels is +15 dB or -15dB at a 
given frequency', this frequency is considered to occur only in the dominant channel. 

Independently coded 

The following parameterization can be used to code the exemplary stereo signal 
20 independently. 

L(f,A) = (50,30), (100,50), (250,40), (500,40) 
R(f,A) = (50,20), (100,60), (200,30), (500,35) 
This parameterization requires 16 parameters. 



25 Using common frequencies and non-common fi^equencies 

Common frequencies are 50 Hz, 100 Hz and 500 Hz. To code this signal: 
(Fcom,A™x, AA) = (50,30,10), (100,60,-10), (500,40,5) 

(F non-com^ 

A) = (200,-30), (250,40) 
Coding the exemplary stereo audio signal using common and non-common frequencies 
30 requires 13 parameters in this example. Compared to the independently coded multi-channel 
signal, the use of common frequencies reduces the number of coding parameters. Further, the 
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values for tlie delta amplitude are smaller than for the absolute amplitudes as given in the 
mdependently coded multi-chamiel signal. This further reduces the bit-rate. 

The sign in the delta amplitude AA determines the dominant channel (between 
two signals). In the above example, a positive amplitude means that the left channel is 
5 dominant. The sign can also be used in the non-common frequency representation to indicate 
for which signal the frequency is valid. Same convention is used here: positive is left 
(domuiant). It is alternatively possible to give an average amplitude in combination with a 
difference amplitude, or consistently the amplit-ude of a given channel with a difference 
amplitude relative to the other chaimel. 

1 0 Instead of using the sign in tlie delta amplitude AA to determine the dominant 

chamiel, it is also possible to use a bit ia the bit-stream to indicate the dominant channel. This 
requires 1 bit as may also be tlie case for the sign bit. This bit is included in the bit-stream 
and used in the decoder, hi the case that an audio signal is encoded with more than two 
channels, more than 1 bit is needed to indicate the dominant channel. This implementation is 

1 5 straightfonvard. 



Use of only common frequencies 

Wlien only a representation based on common fr equencies is used, the non-common 
frequencies are coded such that the amplitude of the common frequency in the channel in 
20 which no sinusoid occurs at that frequency is zero. In practice, a value of e.g. +15 dB or -15 
dB for the delta amplitude can be used to indicate that no sinusoid of the current frequency is 
present in the given chamiel. The sign in the delta amplitude AA determines the dominant 
channel (between two signals). In tliis example, a positive ampUtude means that the left 
chamiel is dominant. 

25 (Fcom,A, AA) = (50,30,10), (100,60,-10), (200,30,-15), (250,40,15), (500,40,5) 

This parameterization requires 15 parameters. For this example, the use of onl)'^ common 
fi equencies is less advantageous than the use of common and non-common frequencies. 



Frequency average and differences 
30 (Fav, AF,Aav, AA) = (50,0,25,5), (100,0,55,-5), (225,25,35,5), (500,0,30,10) 
This parameterization requires 16 parameters. 

Tliis is an alternative encoding wherein the sinusoidal components in the signal are 
represented by average frequencies and average amphtudes. It is clear that also compared 
with this coding strategy, the use of common frequencies is advantageous. It is noted that the 
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use of average jSrequencies and average amplitudes can be seen as a separate invention 
outside the scope of the current application. 

It is noted that not strictly the number of parameters but rather the sum of the 
5 niunber of bits per parameter is of unportance for tlie bit-rate of the resulting coded audio 
stream. In tliis respect, differential coding usually provides a bit-rate reduction for correlated 
signal components. 

The representation with a common fiequency parameter and respective 
1 0 amphtudes (and optionally respective phases) can be regarded as a mono representation, 

captured in the parameters common frequency, avei age or maximum amplitude, phase of the 
average or maximimi amplitude (optional) and a multi-chamiel extension captured in the 
parameters delta amplitude and delta phase (optional). The mono parameters can be treated as 
standaid parameters that one would get in a mono sinusoidal encoder. Thus, these mono 
15 parameters can be used to create links between sinusoids m subsequent frames, to encode 
parameters differentially according to these Unks and to perform phase continuation. The 
additional, multi-channel parameters can be encoded according to strategies mentioned above 
wliich fruther exploit binaural hearing properties. The delta parameters (delta ampUtude and 
delta phase) can also be encoded differentially based on the linlcs that have been made based 
20 on the mono parameters. Further, to provide a scalable bit-stream, the mono parameters may 
be included in a base layer, whereas the multi-channel parameters are included in an 
enhancement layer. 

In the traclcing of the mono components, the cost function (or similarity 
measure) is a combination of the cost for the frequency, the cost for the amplitude and 

25 (optionally) the cost for tlie phase. For stereo components, the cost function may be a 

combination of the cost for the common frequency, the cost for the average or maximum 
amplitude, the cost for the phase, the cost for the delta ampUtude and the cost for the delta 
phase. Alternatively, one may use for the cost fimction for stereo components: the common 
frequency, the respective amplitudes and the respective phases. 

30 Advantageously, the sinusoid parameterization using a common frequency and 

a representation of the respective amplitudes of that frequency in the respective channels is 
combined with a mono transient parameterization such as disclosed in WO 01/69593- Al 
(Apphcant's reference PHNL000120). This may ftirther be combined with a mono 
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representation for tlie noise such as described in WO 01/88904 (Applicant's reference 
PHISrL000288). 

Although most of the embodiments described above relate to two-channel 
audio signals, the extension to three or more channel audio signals is straightforward. 
5 Addition of an extra channel to an already encoded audio signal can 

advantageously be done as follows: it suffices to identify in the encoded audio signal that an 
additional channel is present and to add to the encoded audio signal a representation of the 
amplitudes of the common frequencies present in the extra channel and a representation of 
the non-connnon frequencies. Phase inforaiation can optionally be mcluded in the encoded 
10 audio signal either. 

In a practical embodiment, the average or maximum ampUtude and the 
average phase of the largest amplitude at a conmion frequency are quantized sunilar to the 
respective quantization of the delta amplitude and the delta phase at the common frequency 
for the other chamiel(s). Practical values for the quantization are: 
15 connnon frequency resolution of 0.5 % 

amplitude, delta amphtude * resolution of 1 dB 

phase, delta phase resolution of 0.25 rad 

The proposed multi-chamiel audio encoding provides a reduction of the bit rate when 
20 compared to encoduig tlie chaimels independently. 



Fig. 4 shows a system according to an embodiment of the invention. The 
system comprises an apparatus 1 for transmitting or storing an encoded audio signal [S]. The 
apparatus 1 comprises an input unit 10 for receiving an at least two-channel audio signal S. 

25 Tlie input unit 10 may be an antemia, microphone, network connection, etc. The apparatus 1 
fiirther comprises the encoder 1 1 as shown in Fig. 1 for encoding the audio signal S to obtain 
an encoded audio signal with a parameterization according to the current invention, e.g. (fcom, 
Aav, AA) or (fcom, Amax, AA), Tlic eucodcd audio signal parameterization is fijrnished to an 
output unit 12 which transforms the encoded audio signal in a suitable format [S] for 

30 transmission or storage via a transmission mediimi or storage medium 2. The system fiirther 
compiises a receiver or reproduction apparatus 3 which receives the encoded audio signal [S] 
in an input unit 30. The input unit 30 extracts from the encoded audio signal [S] the 
parameters (fconi> Aav, AA) or (fcom, Anmx, AA). These parameters are fimiished to a decoder 31 
which synthesizes a decoded audio signal based on the received parameters by generating the 
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common frequencies having the respective amplitudes in order to obtain the two channels L 
and R of tlie decoded audio signal S'. The two channels L and R are furnished to an output 
unit 32 that provides the decoded audio signal S\ The output unit 32 may be reproduction 
unit such as a speaker for reproducing the decoded audio signal S\ The output unit 32 may 
also be a transmitter for further transmitting the decoded audio signal S ' for example over an 
in-home network, etc. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit tlie invention, and tliat tliose skilled in tlie art will be able to design many alternative 
embodhnents without departing from tlie scope of the appended claims. In the claims, any 
reference signs placed between parentheses shall not be construed as limiting the claun. This 
word ^comprising' does not exclude the presence of other elements or steps than those listed 
in a claim. The invention can be implemented by means of hardware comprising several 
distmct elements, and by means of a suitably programmed computer. In a device claim 
enumeratmg several means, several of tliese means can be embodied by one and the same 
item of hardware. The mere fact that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 
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CLAIMS: 



1. A method of encoding (1 1) an at least two-channel audio signal (L,R), the 
method comprising: 

deteraiining (110) common frequencies (fcom) in the at least two channels 
(L,R) of the audio signal, wliich common frequencies occur m at least two of the at least two 
5 channels of the audio signal, and 

representing (111) respective sinusoidal components in respective channels at 
a given common frequency by a representation of the given common frequency (fcom) a^id a 
representation of respective amplitudes (A,AA) of the respective sinusoidal components at 
the given common frequency. 

10 

2. A method of coding as claimed in claim 1, wherein the representation of the 
respective amplitudes (A,AA) comprises an average amplitude (A) and a difference 
amplitude (AA). 

15 3. A method of coding as claimed in claim 1 , wherein the representation of the 

respective ampUtudes (A,AA) comprises a maximum amplitude (A) and a difference 
amplitude (AA). 

4. A method of coding as claimed in claim 1, wherein non-common frequencies 
20 are coded as conmion frequencies, wherein the amplitude representation includes an 

indication for indicating the at least one channel in which the frequency does not occur. 

5. A method of coding as claimed in claim 1, wherein in addition to the common 
frequencies, non-common frequencies are coded independently. 

25 

6. A method as claimed in claim 5, wherein the non-common frequencies are 
grouped in the coded audio stream in a separate block. 
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7. A method as claimed in claim 6, wherein the common frequencies are grouped 

and included in the encoded audio signal preceding to tlie block of non-common frequencies. 



8. A method as claimed in claim 6, wherein the parameters of tlie sinusoidal 

components at the common frequencies are included m a base layer and the parameters of the 
smusoids at non-common frequencies in an enhancement layer. 



9. A method as claimed in claim 1, wherein the method compiises the step of 

combioing respective power or energy representations of the at least two channels to obtain a 
10 common representation and wherem the step of determining the conmion frequencies is 
perforaied based on the common representation. 



10. A method as clamied in claim 9, wherein the combining step mcludes adding 

power spectra of the at least two channels and wherein the conmion representation is a 
15 common power spectrum. 



11. A method as claimed in claim 1, wherein frequency and amplitude parameters 

are included in a base layer and the delta amplitude is included in an enhancement layer. 



20 12. A method as claimed in claim 1, wherem respective phases of the respective 

sinusoids at the given common frequency are determined and wherein a representation of the 
respective phases is included in the encoded audio signal. 



13, A method as claimed in claim 12, wherein the representation of the respective 

25 phases includes an average phase and a difference phase. 



14. A method as claimed in claim 12, wherein the representation of the respective 

phases includes a phase of the channel with a largest ampUtude, and a difference phase. 



30 15. A method as claimed in claim 12, wherein the represCTitation of the respective 

phases is only included in tlie signal for sinusoids having a frequency up to a given threshold 
frequency. 
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16. A method as claimed iii claim 1 5, wherein the given threshold frequency is 
about 2 kHz. 

17. A method as claimed in claim 12, wherein the representation of the respective 
5 phases is only included in tlie signal for sinusoids having an amphtude difference with at 

least one of the other chamiels up to a given amplitude threshold. 

IS. A metliod as claimed in claun 17, wherein the given amphtude threshold is 10 

dB. 

10 

19. An encoder (11) for encoding an at least two-channel audio signal (L,R), the 
encoder comprising: 

means (1 10) for determining common frequencies (fcom) in the at least two 
channels (L,R) of the audio signal, wliich common frequencies occur in at least two of the at 
1 5 least two channels of the audio signal, and 

means (111) for representing respective sinusoidal components in respective 
channels at a given common fr equency by a representation of the given common frequency 
(fcom) and a representation of respective amplitudes (A,AA) of the respective sinusoidal 
components at the given common frequency. 

20 

20. An apparatus (1) for transmitting or recording, the apparatus comprising 

an input unit (10) for receiving an at least two-channel (LJR.) audio signal (S), 
an encoder (1 1) as claimed in claim 19 for encoding the audio signal (S) to 
obtain an encoded audio signal ([S]), and 
25 an output unit for providing tlie encoded audio signal ([S]). 

21. An encoded audio signal ([S]) representing an at least two-channel audio 
signal (L5R), tlie encoded audio signal comprising: 

representations of common frequencies (t:om), which common frequencies 
30 represent frequencies wliich occur in at least two of the at least two chaimels of the audio 
signal [S], and 

for a given common frequency (fi;om), a representation of respective ampUtudes 
(A,AA) representuig respective sinusoidal components in respective channels at the given 
common frequency. 
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22. A storage medium (2) having stored thereon a signal as claimed in claim 21. 

23. A method of decoding (31) an encoded audio signal ([S]), the method 
5 comprising: 

receivmg (31) the encoded audio signal ([S]) representing an at least two- 
chaimel audio signal (L,R), the encoded audio signal comprising representations of common 
jfrequencies (fcom), which common frequencies represent frequencies which occur in at least 
two of the at least two channels of the audio signal [S], aad for a given common frequency 
1 0 (fcom)^ a representation of respective amplitudes (A,AA) representing respective sinusoidal 
components in respective chamiels at tlie given common frequency, and 

generating (31) the coimnon frequencies at the respective amphtudes in the at 
least two channels (L,R) to obtain a decoded audio signal (S'). 

15 24. A decoder (31) for decodmg au encoded audio signal ([S]), the decoder 

comprising: 

means (31) for receiving the encoded audio signal ([S]) representing an at least 
tsvo-chamiel audio signal (L,R), the encoded audio signal comprising representations of 
common frequencies (fcom)> which common frequencies represent frequencies which occur in 
20 at least two of the at least two chamiels of the audio signal [S], and for a given common 
frequency (fcom), a representation of respective amplitudes (A,AA) representing respective 
smusoidal components m respective channels at the given conmion frequency, and 

means (31) for generating the conmion frequencies at the respective 
amplitudes in the at least two channels (L,R) to obtain a decoded audio signal (S'). 

25 

25. A receiver or reproduction apparatus (3), the apparatus comprising: 

an input unit (30) for receiving an encoded audio signal ([S]), 
a decoder (3 1) as claimed in claim 24 to decode the encoded audio signal ([S]) 

to obtain a decoded audio signal (S), and 
30 an output unit (32) to provide the decoded audio signal (S). 
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